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Abstract 

The quantum basin hopping algorithm for continuous global optimisation combines 
a local search with Grover's algorithm, and can locate the global optimum using effort 
proportional to the square root of the number of basins. This article establishes that 
Jordan's quantum gradient estimation method can be incorporated into the quantum 
basin hopper, providing an extra acceleration proportional to the domain dimension. 
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1 Introduction 

Recent articles mE10ElE] nave studied the application of Grover's quantum algorithm |21E] 
to the general unconstrained global optimisation problem. Given a domain point and corre- 
sponding objective function value, we can dynamically construct a quantum oracle classifying 
any other domain point as either better or not. Using this oracle in Grover's algorithm pro- 
duces an implementation of pure adaptive search that is, an improving sequence of 
domain points, each uniformly distributed among all points better than the previous. De- 
tails in H] show that the effort required by this approach is on the order of the square root 
of the effort required by pure random search P] . 

Of course, for many problems, more sophisticated methods than pure random search can 
be used on a Turing computer. Typically in these methods the idea is to be "guided" toward 
optimal domain points by the objective function's "terrain" , that is, the shape of its graph. 
For instance, the multistart method is a simple hybrid of pure random search and a local 
descent search, in which independent exhaustive local searches are conducted from many 
randomly chosen initial points. The quantum basin hopper 0, a quantum analogue of the 
multistart method, combines the quadratic acceleration due to Grover's algorithm with the 
potentially huge acceleration over pure random search to be gained by using a local search. 

In practice, objective functions in continuous domains are usually differentiable, at least 
piecewise. Being guided by the terrain, in this case, will largely mean responding to the 
gradient, an analytic expression for which may or may not be available. Local optimisation 
techniques for such problems generally fall into two categories. Gradient methods, such 
as gradient descent and conjugate gradient, require explicit computation of the gradient at 
each sample point. On the other hand, gradient-free methods, such as the Nelder-Meade 
simplex method and simulated annealing, attempt to respond to the gradient in an indirect 
way, without requiring its explicit computation. (The methods named are all described 
in 52-) Much of the reason for gradient-free methods is that, when an analytic expression 
is available for the objective function but not for its derivative, the number of function 
evaluations required near a point to estimate the gradient increases linearly with the domain 
dimension. In many problems of practical interest, the domain dimension is large enough to 
make this unpractical. 
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The quantum complexity of estimating an objective function's gradient, based solely on 
function evaluations, is constant, rather than linear, in dimension. Specifically, 10 shows 
that the gradient can be estimated at a point using just one (superposed) evaluation of the 
objective function. In it is established, under mild regularity conditions, that the error 
in the gradient estimate can be reduced to any required level, by appropriately selecting the 
numerical encoding and calculation precision and other parameters of the gradient estimation 
algorithm. 

Therefore it seems that gradient information is cheap in quantum computation, and ought 
to be incorporated into local optimisation methods, and into the quantum basin hopper. This 
paper discusses that possibility. The difficulty is that, while |S] assumes the local search to 
act detcrministically, quantum gradient estimation produces in general a nontrivial super- 
position of computational basis states. This is not simply a matter of numerical error; the 
indeterminacy, together with the marking oracle used in the Grover search, causes unwanted 
entanglement between the search register and the gradient estimation register. We will argue 
that the error bound established in [Hj implies that the algorithm's performance is substan- 
tially unaffected by this entanglement. 

This paper's structure is as follows. Section|21presents the required details of the quantum 
basin hopper. In Section^ we consider the error introduced by the indeterminacy of quantum 
gradient estimation, and conceptually divide it into two different kinds of error, "large" and 
"small" . Section 0] shows that the amplitude of the "large" errors can be made too small to 
disrupt the quantum basin hopper. Section shows that the numerical scale of the "small" 
errors adequately restricts the effect of the entanglement. Section concludes the paper and 
speculates on related future research. 

2 The quantum basin hopper 

The quantum basin hopper algorithm is described in [5]. The specifically quantum part of 
the algorithm, denoted GBS(y, r, L) below, is nested within two loops which can be executed 
classically. 

Quantum basin hopper with parameters (A, M, L) 

randomly choose ieS 

descend from x to a local minimum x cand 

Vcand <~ f(x) 

repeat 

V * Vcand 
m «— 1 

repeat 

choose an integer r uniformly at random from {0, . . . , \m — 1] } 
set x cand and y ca nd by measuring the output of GBS(F, r, L) 
m <— Xm 
until y cand < Y or m > M 
until y cand > Y 

The quantum part GBS(K, r, L) simply uses Grover's algorithm with r rotations to seek a 
domain point whose containing basin extends below Y. (Basins are exlored using L iterations 
of the local search.) GBS(F, r, L) operates on a Hilbert space <g> 72., encoding the 

state of a system containing L + l registers storing points in the domain of /, and one register 
storing a value of /. It proceeds as follows. The state is initialised to \s) ® |0-d)® L (8) \0n), 
where \s) is the equal-amplitude state in T>. Then, the operator 

UfU d . L ■ ■ ■ u dA (u s uil ■ ■ ■ l \ L lj \ I jl .u. ■ ■ ■ v dtl y (1) 

is applied. Finally, the last two registers are measured, and their values are returned as x cand 
and y ca nd- The factors process data as follows: 

U f \x Q )®---®\x L )®\y) = \xQ)®---®\x L )®\y + f{x L )), 
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U s \x )®---®\x L )®\y) 



((2|s) (8\-I)\xo))®---®\x L )®\y), 

f |aj )®-"<8|a:£)®|»), y>Y, 
\ -\x )®---®\x L )®\y), 2/ < Y, 



U^xo) ® • • - ® \x L ) ® \y) 



(where, for unitarity, the addition y + f{x£) is modulo the ordinate register's storage capac- 
ity). The operator Ud,k updates the (k + l)th register on the basis of the fcth, representing 
a single iteration of the local minimisation. In j> , it is assumed to act deterministically, so 
that Ud.L ■ • • Ud,i maps the computational basis state 



where x\, . . . ,Xl is the local search's sample path when started at xq. 

3 Errors due to quantum indeterminacy in gradient es- 
timates 

In this article though, we assume instead that each operation Ud,k involves a single quantum 
gradient estimation, so that the operation (JJJ involves (2r + l)L of them. The effect of the 
indeterminacy inherent in the gradient estimation method on Grover's algorithm requires 
contemplation. The terminus of the local minimisation sample path resulting from each 
domain point will be a quantum superposition, peaked around a point near the bottom of 
the point's containing basin, but having a nonzero amplitude at every domain point in general. 
Accordingly, the marking oracle involved in Grover's algorithm, which is here intended to 
identify points in basins extending below Y, will always slightly entangle the register storing 
the sample path's origin with the circuitry estimating the gradient. 

The aim of this article is to demostrate that Grover's algorithm, and therefore the quan- 
tum basin hopper, can still function despite this entangling. This will be done by considering 
separately the effects of "large" errors and "small" errors in the gradient estimates. The error 
in a gradient estimate is simply the magnitude of the difference between the estimate and 
the true gradient, and it is considered "large" if it exceeds 8, and "small" otherwise, where 
8 is a positive constant of our choosing. 

4 Large errors 

If \g n ) is the nth gradient estimate, let \g' n ) denote the normalised projection of \g n ) onto the 
span of estimates with error no larger than 8. Theorem 1 of 6 implies that, for any positive 
e and 8, sufficiently careful quantum gradient estimation will ensure that \\\g' n ) — \g n )\\ < e - 
Let \<j>) represent the computer's state just before measurement at the end of GBS(y, r, L), 
and let \<f/) represent the corresponding state if the gradient estimates \g n ) are replaced with 
\g' n ). Clearly |||0') — \(f))\\ < (2r + l)Le, which can be made as small as required. Therefore 
we can safely ignore large errors. 

5 Small errors 

It remains only to show that the quantum basin hopper can tolerate small errors, that is, that 
if gradient estimate errors were uniformly bounded by a sufficiently small 8, the quantum 
calculation GBS(y, r, L) would locate a basin extending lower than Y with a high probability 
(for suitable rotation count r, and assuming such basins exist). 

Let / : D — > K be the objective function before discretisation. Suppose D C W, and let 
(D, fi) be a measure space with n(D) finite and / measurable. 



\x ) <g> |0) <g> 



® |0) ® |0) 



to the computational basis state 



® \xi) ® 



<8> \x L ) <g> |0), 
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Let di,(xo,Ui, . . . ,Ul) denote the Lth iterate of the local search, starting from xq, and 
assuming that the kth gradient estimate includes an error of Uk- Let 



W+ = {x : d L (x,0,...,0) >Y}, 
Wy = {x : d L (x,0, ...,0) =Y}, and 
W- = {x : d L (x,0,...,0) < Y}. 

For almost all Y, Wy will have measure zero. Ordinarily, for a wide class of local optimisation 
methods, dh is continuous in all its arguments, almost everywhere. Thus, letting B$ represent 
the closed ball around £ MP with radius S, the set 

Wy,s — {x : dz(x,Bs, . . . , Bg) contains elements less than and greater than Y} 

decreases, as S — > 0, to Wy- This means that we can make Wy,s as small as we like by choosing 
5 small enough, that is, by computing the gradient estimates with sufficient precision. 

Now suppose S discretises D, so that V is spanned by {|x) : x E S}. Partition S into 
S a , S/3 and S-y, where 

s f3 = snwy,«, 
s a = (s\s l3 )nw+, 
s 7 = (S\Sp)nW- 

Section [5] indicates that the quantum search involves L + 1 domain-point registers and 
one range value register; in practice, there will be extra storage for "scratch space" , including 
the register for the gradient estimation sampling grid. Taking all of this circuitry in account, 
define the Hilbert space O such that T> ® O represents the state all registers involved in the 
quantum search, with T> representing the register storing xq, and O representing all other 
registers. Let |0e>) represent the initialised state of O. 

Let 



N Q = 


\Sa\, 


\a) = 




)|0 o ) : 


x g s a }/\ 




N p = 


\Sp\, 


\0) = 




)|0 o ) : 


x G s }/ y 




N 7 = 


|5 7 |, 


|7> = 




)|0 o ) : 


x G S 7 }/ ^ 






G = 






UfU d . L 


■■■U d ,u 





Let 



so that GBS(F, r, L) simply applies U f U djL ■ ■ ■ U dA G r to \s) <gi |0 o ). 
Then 



G\a) = — l \a) + — v — \/3) + -^ 1 7 ) and 



oh) - -V^-Vg,^ ^+%-% ); 

however, G\f3) is not a tensor multiple of |0q), and represents an entangled state. Let \Pk) 
denote G fc 

Define 9, rj, ( € [0, tt/2] by 

2 Q N l 2 (N* " ^7) 2 2, (N a + N p /2) 2 

cos 9 — — cos r? = - — — ^ o cos C = - — — — . 

4jV Q iV 7 ' {N a + N 1 ) 2 -N 2 S NN a 

Define 

|X+) = ^ (e <fl |«> - |7» IX-> = ^= (e- ie |a) - |7» • 



4 



The Grover operation (not exactly a rotation in this context) acts on |x±) as follows: 
G\X+) = ^=U s U-^--U- 1 L U < U f U d .L---U d .,{e^\a)-\ 1 )) 



V2 



Us (e ie \a) + \ 7 )) 



\a) ( 2 e ie N a 2^N^ _ _ 

N N J y/2\ N N 



I "7 < 
V2\ 



|7) / 2e ie y/NJV; i 2N 7 ^ 



= 1- 



2,/N, 



N 



= r-e--| X+ ) + ^(e^v / ^+v^) 



where 



A^ - jVg + A^ 7 
N 



, and similarly 



G|x-> = re i "| X -) + ^^(e- ie A /jV:+ ^7) 



Therefore 



fe-i 



G fc |x+) - ^^"l^) + (e«V^+ V^) ^'eT^lft-i-i) and 

■ ^ 



G fc | X -) = rV fc %_) + 



A' 



Noting that the initial state |s) (8) |0e>) equals 



WN a +e^yN 1 )\ X+ ) 



'2N sin 



'2N sin 



(v/jV^+e*^)!*-) 




we have 
G k \s) ® 



IT 



'2N sin0 



(-e- ifc "(v^ + e"* V^7)lx+) + e*«(y/N^ + e lf> v^)lx->) 




NvNsmo „ v ' 

= f (0Vq sin(fc?7) + y]V^sin(fc?7 + 0))|t) - ("\//Va sin(fc?7 - 0) + ^/N^ sm{k-q))\a) 

/ sin V 

\Pk) 



fe-1 



a/A 7 sin^ 
A^ 




fc-i 



sin! 



(sin(fc?7 + C)|t) ~ sinOr? + C - 0)|a)) 




fc-i 



3=0 



In particular, noting that |t) is orthogonal to \a) and \(5) but not necessarily to \(3k) for 



k > 0, we have 



( 7 |G fe | s ) 



> 



' sin(fc?7 + C) 



k-1 



sin 


sin# 


r fc sin(fc77 + Q 


2 


sin 


sin6> 


r fe sin(fc?7 + Q 


i 


sin 


sin# 



r J sin 



2C-j»7)| 



(2) 



-fc-i 



sin(6> — 2£ — J77) for small fe 




1 — r k e ikri 
1 - re^ 



,-i(2C-e) 



1 - r"e 



1 - re~ lr i 



This bound is maximised when 



r] — i In r 
1 - re* 7 ' 



= In r sm(kr] + () + r\ cos(ki] + (), 



that is, when 



where 



1 2JN /Npcos(2(-8 + 
atn 



In r sin £ — r\ cos C 



r? 2 V / N (3 /Npsm(2( - 6 + 0) + InrcosC - jysinC ' 
# r?-«liir 



1 - re 17 * 

This reduces to the familiar situation when Np = 0; note that then we have 
2' 



cosry = 1 - 2-^-, 



C = »?/2, 



and the optimal k is ir/2r] — 1/2. 
More tractibly, from (J2J) we have 



(7| G k \s) > ^ (r k sin(fc77 + Q - 2k^N p /N 
If Np <^ N 1 <€. N a , then the usual estimate fc = (tt / 4) yj N / N y gives 
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thus if the number of iterations is chosen optimally, the error rate is on the order of yj Np /N 1 . 
This will usually be overshadowed by errors due to unknown target size, which are already 
known to be manageable 

To summarise, "small" errors complicate the required implementations of Grover's algo- 
rithm by introducing an entangling subset of the search space. In the usual search algorithm, 
the search space is partitioned into marked and unmarked regions; here we have instead a 
partition into marked, unmarked and entangling regions 1 . It is established here that if the 
entangling region is much smaller than the marked region, then the search algorithm will still 
succeed with a high probability. 



6 Conclusion and further work 

Taking, as a performance baseline, the pure random search algorithm [3] (simply, randomly 
sampling the domain repeatedly), the multistart method provides a large improvement, for a 
wide class of optimisation problems. Separately, implementing pure adaptive search ^] 
via Grover's algorithm 0] E] provides an acceleration on the order of the square root of 

lr The good, the bad and the ugly. 
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the domain size; this is impressive for a method not relying on any problem structure, but 
quickly lags behind multistart for moderately structured problems. In [5] it was shown how 
the quadratic acceleration due to Grover's algorithm, and the acceleration (in proportion 
to the number of domain points in the deepest basin) due to multistart, could be achieved 
simultaneously. 

It is now known that the quantum complexity of gradient estimation is smaller than its 
classical complexity by a factor which is linear in domain dimension |BJ I10| . The question 
naturally arises whether we can simultaneously claim the benefit of this linear acceleration, 
alongside the accelerations due to Grover's algorithm and to multistart. This paper provides 
the answer. Section describes the quantum basin hopper algorithm. Sections 03 to study 
the error introduced into the algorithm by the indeterminacy inherent in quantum gradient 
estimation, using a "divide-and-conquer" approach. Section [3] splits the error in gradient 
estimation into "large" and "small" errors. Section 0] shows that "large" errors are unprob- 
lematic due to their low amplitude. SectionEJshows that "small" errors only corrupt Grover's 
algorithm on a small subset of the search space, with only slight overall effect. 

To pursue this work further, a natural first step would seem to be to tighten the bounds 
established, in order to predict, for a given optimisation problem, what algorithm param- 
eters and computational precision should be used. A careful analysis of the performance 
of a quantum-gradient-estimate implementation of, say, a conjugate gradient method on a 
quadratic form might yield insights. 
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